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COMMON ANCESTOR TYPE DISTRIBUTION: A MORAN 
MODEL AND ITS DETERMINISTIC LIMIT 

FERNANDO CORDERO^ 


Abstract. We study the common ancestor type distribution in a 2-type 
Moran model with population size N, mutation and selection, and in the de¬ 
terministic limit regime arising in the former when N tends to infinity, without 
any rescaling of parameters or time. In the finite case, we express the common 
ancestor type distribution as a weighted sum of combinatorial terms, and we 
show that the latter converges to an explicit function. Next, we recover the 
previous results through pruning of the ancestral selection graph (ASG). The 
notions of relevant ASG, finite and asymptotic pruned lookdown ASG permit 
to achieve this task. 
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1. Introduction 

A variety of (stochastic) models describes the interplay of mutation and selec¬ 
tion in populations in the forward direction of time, of which the Wright-Fisher 
and Moran models appear as major cornerstones (see, e.g., [HE]). Corresponding 
ancestral processes, starting at present and tracing back the ancestry of individuals 
into the past, are well studied and currently constitute an active area of research 
(dSlEniEIlIIH!). It is a common feature of most of these models that at any time 
there is one individual which is at a later time ancestral to the whole population. 
Such an individual is called common ancestor. 

In this paper we are interested in the type distribution of the common ancestor 
in a 2-type Moran model with mutation and selection and its asymptotic behaviour 
when the population size tends to infinity. For the latter we consider two different 
regimes: the diffusion limit and the deterministic limit. In the first regime, the time 
and the parameters of mutation and selection are rescaled in such a way that, when 
the population size tends to infinity, the proportion of fit individuals converges to 
the Wright-Fisher diffusion. By contrast, when the time and the parameters of the 
model are not rescaled and the population size tends to infinity, the proportion of 
fit individuals converges to the solution of an ordinary differential equation (see 
a)- In this case, we talk about the deterministic limit regime. 

In the diffusion limit regime, the common ancestor type distribution has been 
widely studied (see [HI IHI |H] for the case without mutation and [3] [Ml HB] for 
the general case). Let hoo{x) be the probability that the common ancestor is fit 
given that the current proportion of fit individuals is x. In m. Taylor shows that 
/loo is the solution of a boundary value problem. In addition, he gives a series 
expansion for h^a in terms of Fearnhead’s coefficients (introduced in [^). In the 
recent work [TB]) the authors construct a pruned version of the untyped ancestral 
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selection graph, called pruned lookdown ancestral selection graph (pruned LD- 
ASG). Based on the pruned LDS-ASG, they recover the series expansion for h^o 
in a graphical way. They also show that the Fearnhead’s coefficients correspond to 
the tail probabilities of the stationary number of lines in the LD-ASG. The main 
goal of this paper is to extend these results to the finite and the deterministic limit 
setting. 

In the Moran model of size N with selection and mutation part of the afore¬ 
mentioned results are also available. Let us denote by the probability that the 
common ancestor is fit given that the initial population has exactly k fit individu¬ 
als. In m, is expressed as a weighted finite sum of combinatorial terms. The 
weights are defined through a 2-step forward recursion, and we refer to them as 
Fearnhead-type coefficients. This representation of is unfortunately not closed, 
since one of the equations in the recursion depends on the value h^_i. In order to 
complete the picture we provide an analytical and a graphical approach. 

The analytical approach consists of two main steps. We first characterise the 
Fearnhead-type coefficients as the unique solution of a slightly different recursion 
depending only on the parameters of selection and mutation. Next, using some 
elements of the theory of matrices, we show that the Fearnhead-type coefficients 
correspond to the tail probabilities of some random variable. This approach does 
not provide any extra graphical meaning beyond the results in [14] . 

The graphical approach permits to characterise the as in and simultane¬ 
ously, to recover the results we obtained with our first approach. In the case without 
mutation, this can be done as in m with the help of the ancestral selection graph 
(ASG- [ISl[2n]). In the presence of mutations part of the ASG becomes irrelevant, 
and hence, the problem can not be treated in the same way. We solve the problem 
by means of two ways of pruning the untyped ASG. We define the relevant ancestral 
selection graph (relevant ASG), and then, following the lines of [T5], we extend the 
notion of pruned LD-ASG to the finite population case. Using these constructions, 
we show that the Fearnhead-type coefficients correspond to the tail probabilities 
of, on one hand, the asymptotic number of lines in the relevant ASG, and on the 
other hand, the stationary number of lines in the LD-ASG. Both representations 
provide a graphical explanation to the representation of the probabilities given 
in M- The pruned LD-ASG gives, in addition, a probabilistic interpretation of 
the Fearnhead-type recursion. 

In the deterministic limit framework, we also use two approaches. First, we show 
that the Fearnhead-type coefficients converge to the tail distribution of a geometric 
random variable. We deduce that the probabilities converge to a function h 
which is explicitly computed. In order to provide a graphical interpretation, we 
take an approach similar to the one used in the finite case. The main difficulty 
is that coalescence events are absent in any suitable asymptotic version of the 
ancestral selection graph. Therefore, the notion of common ancestor does not make 
sense anymore. However, the convergence properties of the number of lines in the 
finite pruned LD-ASG give us a way to define an asymptotic version of the pruned 
LD-ASG (in the deterministic limit regime). This new object and the notion of 
representative ancestral type lead to a nice graphical interpretation of the function 
h. 

The paper is organized as follows. In Section we give a short description of 
the 2-type Moran model with selection and mutation, the diffusion limit, the deter¬ 
ministic limit and some known facts about the common ancestor type distribution. 
Section contains the analytical approach to the study of the probabilities 
and their asymptotic behaviour in the deterministic limit setting. Sections and 
l^are devoted to give graphical interpretations to the problems studied in Section 
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1^ In Section we treat the finite case. We first recall the notion of ASG and 
we introduce the relevant ASG and the lookdown ASG. These objects are used to 
obtain the desired graphical interpretations. In Section]^ we give a meaning to the 
asymptotic results obtained in Section 4. The paper ends with Appendices and 
[B] In Appendix]^ we provide some technical results about the Skorohod topology 
which are needed in Section]^ Finally, in Appendix|^we compare the deterministic 
limit, with the asymptotic properties of a related 2-type branching model. 

All along the paper, we use the following notation, for k,m G Nq, with k < m, 
[m\k '■= [k^m] nNo. When fc = 1, we simply write [m] instead of [to]i. 


2. Preliminaries 


2.1. The 2-type Moran model with selection and mutation. We consider a 
population of size N in which each individual is characterised by a type i € {0,1}. If 
an individual reproduces, its single offspring inherits the parent’s type and replaces 
a uniformly chosen individual, possibly its own parent. The replaced individual 
dies, keeping the size of the population constant. 

Individuals of type 1 reproduce at rate 1, whereas individuals of type 0 reproduce 
at rate 1 + sat, sjv > 0. Mutation occurs independently of reproduction. An 
individual of type i mutates to type j at rate ujy Vj, un > 0, Vj G (0,1), uq + vi = 1. 

The Moran model has a well-known graphical representation as an interacting 
particle system (see Fig. [^. Individuals are represented by horizontal lines. Time 
runs from left to right. Each reproduction event is represented by an arrow with 
the parent at its tail and the offspring at its head. We decompose reproductions 
into two kinds of events: neutral and selective. Neutral reproductions are depicted 
as arrows with filled heads and selective ones as arrows with open heads. Neutral 
arrows may be used by all individuals, whereas selective arrows may be only used 
by individuals of type 0. Mutations to type 0 are represented by open circles and 
mutations to type 1 by filled circles. 

More precisely, for each i,j G [N] with i ^ j, and denote two Poisson 
processes with respective rates sn/N and 1/A^. Similarly, for each i G [A^], 
and Xl'^ stand for two Poisson processes with respective rates u^vq and u^vi. We 
assume that all these processes are independent and we refer 


A^ := {A: 


0,N 


\1,N t\A,N , 


A,A| 


je 


[A]/{i}}zG[Ar] 


as the reproduction-mutation process. Now, we draw arrows and circles in the space- 
time coordinate system [0,oo) x [N] as follows. At the arrival times of Xf’J^ and 
Xf'f^, we draw selective or neutral arrows respectively, going from line i to j. At 
the arrival times of A°’^ and Xj’^, we draw respectively open or filled circles at 
line i. So far we have constructed an untyped version of the Moran model. Finally, 
given an initial configuration of types, we propagate the types forward in time 
respecting reproduction and mutation events. Relevant information of the evolution 
of the population is given by the continuous-time Markov chain := 
where X^ counts the number of individuals of type 0 at time t. The infinitesimal 
parameters of are given by 

^ sn) + {N - k)uNi'o if^=l, 

dk,k+e •= S 

[ k -I- fc ttjv M if £ = —1, 


and q^k+i ■= ^ for \l\ > 1. Equivalently, the infinitesimal generator of X^, denoted 
by is given by 


Ax^fik) ■■= qk,k+i if{k + 1) - f{k)) -k qk,k-i {f{k - 1) - f{k)) ■ 


( 2 . 1 ) 
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Figure 1. Graphical illustration of the Moran model: types are indicated for the initial 
configuration (left) and the final one (right). Filled and open circles represent mutations to 
type 1 and 0 respectively. Neutral reproductions are depicted as arrows with filled heads 
and selective ones as arrows with open heads. 


In other words, is a birth-death process with birth rates := and death 

rates := In particular, when un > 0, admits a unique stationary 

distribution, which is given by 

k 

7TxN{k):=CNl[^, kG[N]o, (2.2) 

i=i k-i 

where Cm is a normalising constant (see |5]). When ujv = 0, by contrast, X^ is an 
absorbing Markov chain with 0 and N as absorbing states. 


2.2. The diffusion limit. A natural diffusion limit arises in the Moran model 
when the parameters of selection and mutation satisfy 

lim NuM = dG{0,oo) and lim TVsat = cr G (0, oo). (2-3) 

N—^oc N—^oc 

We briefly recall here its construction (see also m p. 71, Lemma 5.11]). Let 
denote the proportion of fit individuals in the population at time Nt, i.e. 

t>0. 

, the infinitesimal generator of the process Y^ := (y)'^)t>o is given by 

■Ay^fip) =N'^p{l-p) {f (p + N-^) + f{p- N-^) -2/(p)) 

-h [sNp{l-p) + {l-p) UNl'o] (/ {p+ N~^) - f{p)) 

+ N'^PUM yi (/ {p-N-^) - f{p )), 

for every p G Em '■= {i/N : i G [A^jo}- Therefore, if / G C^([0,1]), an appropriate 
Taylor expansion leads to 

lim sup lAy^fip) - AYfip) \ = 0, 

fV-s-oo 

where Ay is the generator of the Wright-Fisher diffusion Y, i.e. 

Ayf{x) := a:(l — x)-j^{x) + [a{l — x)x + 0vq{1 — x) — 6vix\ ^ ^ [ 0 : !]■ 

Consequently, Y^ ———> Y (see [SI Theorems 1.6.1, 4.2.11 and 8.2.1]). 

AT —>00 


From (2.1 


2.3. The deterministic limit. In contrast to the diffusion limit framework, where 
the parameters of the model satisfy (2.31, we consider all along this paper constant 
parameters of mutation and selection, i.e. um = u > 0 and sm = s ^ 0. In 
addition, we do not rescale the time. In jl] it is shown that a deterministic limit 
emerges when the size of the population converges to infinity. We summarize here 
the convergence results related to this deterministic limit. Let := be 
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the continuous-time Markov chain given by := , t > 0. For each zq G [0,1], 

we denote by z{zo, ■) the solution of the following ordinary differential equation 


dz 

— (t) = szit) (1 — z{t)) + Wq (1 — z(t)) — UVizU), t>0. 
at 


XA := 


Eq. (2.41 has a unique stable point which is given by 

if s > 0, 
vq if s = 0, 

where A := (s — 4 supo- In addition, Xq satisfies 

lim z{zq, t) = Xq . 

t—^OO 

It is shown in [H Proposition 3.1] that 

Ve,T > 0 : 


(2.4) 


(2.5) 


( 2 . 6 ) 


lim Zq = zo G [0,1] 

N—^oo 


N—¥00 


lim P ( sup \Z^ — z{zq, t)| > e I = 0. 

and we 


t<T 


Now, we set g{x) := —(2 -|- s)x^ -I- (2 -|- s — u{vq — vi)) x + uvq, x G 
define the Gaussian diffusion := {V^°)t>o by 


:= 




\Jg{^)e J e'^'" dB^ if zq = a;J, 


where (i?t)t>o is a standard Brownian motion. In addition, we introduce the char¬ 


acteristic functions := E 


jeVN{Z^-zizo,t)) 


Assuming that u > 0, [H Theorem 3.4] tell us that 


and := E 


j9vr 


lim \/N{Zq — zq) = 0 lim sup\tpN{t,d) — i^itjO)] = 0. 

N—^oo N^oo i>Q 


As a consequence, it is deduced that 

TTzN 


N—^oo 




(2.7) 


where tt^w denotes the stationary distribution of Z^ (see [H Corollary 3.6]). 


Remark 2.1. Eq. (2.4| describes the evolution of the proportion of fit individuals 
in the well-known 2-allele paralell mutation-selection (see [i p. 265];. 

Remark 2.2. The results presented here were obtained in [1] with the help of 
classical results for density dependent families of Markov chains (see [m nzi [g;. 

2.4. The common ancestor type distribution: known facts. It has been 
shown in |24j that, in the diffusion limit framework, the common ancestor type 
distribution takes the form 


OO 

hooix) = ^ x(l - xfi 

n—0 


( 2 . 8 ) 


where the coefficients (an)n>o satisfy the following second-order recursion 

(n-h 0i^i)an — (n-h cr-h d)an-i-h cran -2 = 0, n > 2, (2.9) 

with boundary conditions oq = 0 and lim„_>.oo On+i/cUn = 0. The (a„)„>o were 
introduced in (Hj and we refer to them as Feamhead’s coefficients. In the recent work 
|18j . the authors show that = P{Lao > n), where L^o denotes the stationary 
number of lines in the lookdown ancestral selection graph. In addition, a graphical 
proof of (2.8) is provided. 
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In the finite population case, a representation of the common ancestor type 
distribution, similar to ( 2 . 81 , is given in M- More precisely, a first-step analysis 
applied to the probabilities leads to 


h 


N 

k 


k 

N 


N—k n—1 


n—0 


j=0 


N — k — j 

iv-i-j’ 


k G [lV]o, 


( 2 . 10 ) 


where the coefficients satisfy, for n G — 1 ] 2 , 
a^ = l. 




n 

N 


N-in-1) 

w~ 


N 


s + M a„_i 


TV - (n- 1) 

N 


^n-2' 


(Eo) 

m 

(En) 


3. The common ancestor type distribution: an analytical approach 

As in the diffusion limit setting, in the 2-type Moran model of size N subject 
to selection and mutation, at any time t, there is a unique individual that is, at 
some later time v > t, ancestral to the whole population (this result follows in the 
diffusion limit from [13 Theorem 3.2]). To see this, we fix t > 0 and we define the 
offspring-type process 0*’^ := ({6'-’^(T)}ig[Ar], {j-’^(u)}jg[Af])„>o as follows. 

• For i G [A^j, 0‘’^(O) := {f} and j-’’^(0) denotes the type of the individual at line 
i at time t. 

• For u > 0 and i G [N], 6\’^{v) holds the set of lines occupied at time t v hy 
descendants of the individual located at line i at time t. We denote by (v) the 
type of the individual placed at line i at time t v. 

The offspring-type process is a continuous-time Markov chain with state space 

EN:=r4[N]) X {0,1}"^, 

where V^{[N]) := {{AJ.gjjv] ; Vi j, Ai c [A^j, A,f]Aj = UfcG[iv] 

We point out that 0*’ ^ can be constructed using the reproduction-mutation process 
defined in Section |2.l| or by exhibiting its transition probabilities. The set 

'■= {({^i}iG[IV]5 {jz}iG[Ar]) ^ '■ 3j G [Ai], Aj = [A^]}, 

is a closed set of 0*’^. From any state outside of Sjv, 0*’-^ reaches Sjv with positive 
probability and the state space is finite. Therefore, the probability of absorption in 
Sjv is equal to one. This means that the offspring of one of the individuals at time 
t will fix at a later time. Such individual is called the common ancestor at time t 
and we denote its type by if. The lineage of these individuals over time defines the 
so-called ancestral line (see Fig. [^. In this section we recall well-known facts about 


1 
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t — to t — ti t — ^2 ^ — ^3 

t 

Figure 2. Ancestral line corresponding to Fig. ^ Solide lines represent the descendants 
of the common ancestor at time ti in the time period The bold line represents 

the ancestral line between to and t 2 - In particular, under the initial configuration of figure 
^ we have if^ = 0, =1,4=1. 
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the common ancestor type distribution in the finite case, and we derive new results 
reinforcing them. Next, we analyse the asymptotic behaviour of this distribution in 
the deterministic limit setting. With this in mind, we define as the probability 
that the common ancestor at time 0 is of type 0, given that the initial population 
contains exactly k fit individuals, i.e. := P{I^ = 0 | = k). Equivalently, 

is the probability that an initial population of k fit individuals is ancestral to 
the whole population at some later time. 

Remark 3.1. For all t > 0, the processes and 0°’^, under the same initial 
configuration have the same law. In particular, P{I^ = 0 | = k) = h^. 


3.1. The case without mutation: s > 0 and u = 0. In this case all the involved 
quantities can be explicitly computed. Indeed, since there is no mutation, h^ is 
the fixation probability of type 0 given that in the initial population there are k fit 
individuals. Thus, setting := inf{t > 0 : X^ = k}, [BJ Theorem 6.1] yields 


h^ = P 


rpN ^ rpN 




(! + »)»-> (! + ,) 


N 


l-(l + s) 


N 


k e [N]o. 


In other words, if denotes a geometric random variable with parameter s/(l + s) 
conditioned to be smaller or equal than N, then 


= p{g^ < k). 


(3.1) 


Since, Q ~ Geom(s/(l + s)), we get 

Af —>00 


h^ -^l-(l + s)-'=, fc>0. (3.2) 

Af—>-oo 

This means that the probability h^ converges to a strictly positive number, al¬ 
though the initial proportion of fit individuals becomes negligible when N tends to 
infinity. Moreover, we also have 

N>kN --^ 00 ^ hi --^ 1. (3.3) 

N—^oo N^oo 


These remarkable facts are in contrast with the behaviour of the corresponding 
probabilities in the neutral case, where h^ = k/N. 

On the other hand, if Ljy is a binomial random variable with parameters N and 
s/(l -I- s) conditioned to be strictly positive, the right hand side of (|3.1[) becomes 


N-l 

N fN\ _ /N-i\ N S ii) 

P{G^ <k) = Y^ P{L^ = i) ^ ^ ’ = £ P(Ta = ^) 

^=1 \k) \k) 


N-k 


T^/ T \ ^ TT ^ ^ - j 

j=0 ^ 


n—0 


Therefore, defining a„ := P{L]si > n), we have 


N-k , n-l , 

n and 

n—O j—0 


-.N 


N —^00 


A 1. 


Remark 3.2. The convergence results (3.21 and (3.31 also differ from the diffusion 
limit case, where (see na and m) 


kN 

~w 


N—^00 


> x e [0,1] ^ hi 


-)• 

N —^00 


1 - 

1 - ■ 
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3.2. The case with mutation. Henceforth, we assume that s,m > 0. In contrast 
to the previous case, when we introduce mutations, there is no fixation of types 
anymore. Thus a different approach has to be taken. The problem was solved in 
[H] by studying the Markov process := {D°’^, , X^), where Dl’^ holds 

the number of descendants of type i at time t of an unordered sample with initial 


composition {Dq’^, Dq^'' ). The relation between and is given by 




= P 3t>0: D 


0,N 


Dl’^ = 


N \ = {k,0,k)y ke[N]o. 


This representation and a first-step analysis were u sed in m in order to obtain 
(2.101 and the recursion {£^1^=0 defined in Section 


2.4 


3.2.1. A first characterization of the coefficients . The inconvenience in the re¬ 


cursion is that it depends on the value of hflf_-^. Equation (En\ is a 

consequence of a probabilistic argument. By contrast, {Eq\ and {Ei\ follow di¬ 
rectly from (2.101, plugging in k = N and k = N — 1 respectively, and they are 
particular cases of the next result. 

Lemma 3.3 (Inversion formula). For all i & [A^], we have 

N 




Proof. From (2.101, we see that 

N 




k=£ 


k-1 

V ^-1 




N /r, i\ 


k=e 

N-e 


N-£ N-n , 


n—O 

N 


k^£ 


N 

a 

n—0 

k-i 

\£-l 


N-n-l 
k-1 

N — n — 1 
k-1 


J 

1 


m — 1 

j 


= E E 

m—i j—i—1 

Thus, the result follows from the following combinatorial identity (see e.g. 

m— 1 

E 

where denotes the Kronecker delta. 


e-i 


J 

1 


m — 1 

j 


□ 


The aim now is to replace Equation {Ei\ by another one, independent of (Eq) 
and {En ), for n S — 1 ( 2 , and not involving the values of h^. 

Lemma 3.4 (The missing equation). We have 


(' + “+v) 


'‘N-l 


N 


‘-N-2- 


{En) 


Proof. First, let us denote := h^ — From Lemma 3.3 and the following 


combinatorial identities, which can be derived from the binomial theorem: 


N 

E(-i) 

A .-1 

we deduce that 

N 




N 


A: = 0 and y^(—1)^ 


/c=2 


{k — 1)A: = 0, 


a 


-i-B-i) 


fc+1 




N 


and 




/c=2 


(fc-W- 
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The previous identities and the definition of and lead to 

N 

/ <3 \ 

An ■ = 


(l + u+ —^ Un-i - ^aw-2 - ^1 + ^ + 

N 


s k 

Y 


k=l 

Defining ipQ := -ip^ := 0, and, for k € [fV — 1], := 

N , 7 V ^-1 




N -k 


k 


E(-i) 


fc+i 


and 


fe=i 


N 


¥k 


K 


N-k 


= Nj2i-l) 


fe + 1 


E(-i) 




fc = l 


N -1 


= ivE(-i) 


fe+i 


k=l 

As a consequence, we obtain 

N-l 


k=l 


N-l 

k-1 

N-l 

k 


JN\N 
Yk Afc , 


i’kk^k- 


Atv 

1C 


= E(-i) 


fe+i 




N-l 

k-1 


N-l 




EM) 


fc+i 


k=l 


N- 1 
k 




(3.4) 


In addition, we know from [m Eq. (25), (26) and (27)] that, for all k G [N - 1]: 


(A^ + M = Af+i V'Ci + Ml Ml + ^ 


(3.5) 


Multiplying (3.51 by ( —1)^"'’^and performing the sum over k G [fV — 1] yields 

N-l /,r N-l 

E(_i)fc+ii 


N-2 
k-1 


N-2 
k-1 


A^+iMi 


\fc+i 


N -2 
k-1 


i(Af+ = E(-i)'^" 

k=l ^ ^ k=l 

The last sum equals zero as a consequence of the binomial theorem. Rearranging 
the sums, we obtain 

N-l 7 V -1 1 \ 

^ = 0 . 




k=l 


This identity together with (3.41 implies that Atv = 0 and the proof is completed. 

□ 


The next result tells us that the coefficients {a^)n=o are characterised by the 


equations (|^ and <\EnlnelNh- 

Lemma 3.5 (Uniqueness and positivity of the coefficients). The system of equa¬ 


tions given by (Eq\ and (£’nlnG[Af ]2 a unique solution , which is in 


addition, coordinate by coordinate, strictly positive. 


Proof. We first write the underlying system of equations as follows: 


/do Co 

0 


0 \ 


( ^0 \ 


fX 

bi di 

Cl 




< 


0 

0 



0 



= 


Vo ••• 

bN -2 

0 

dN -2 

bN -1 

CN -2 

dN-lJ 


^N -2 

V®iv-i/ 


0 

Vo/ 


(3.6) 
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where do := 1, cq := 0, 


dn, ■ — 


n + 1 
N 


(N — n)s 

N 


^77, ■- 


{N — n)s 
N ’ 


n G [TV - 1], 


and 


R + 1 

N 


— wi, n G [A^ — 1]. 


The matrix Aj\! in (3.6 1 is strictly diagonally dominant, and hence invertible by the 
Levy-Desplanques Theorem (see for ex. [TTJ Theorem 6.1.10]). Thus, (3.6 1 has a 


unique solution given by the first column of . It remains to prove its positivity. 

Since Gljv is strictly diagonally dominant and all its diagonal entries are strictly 
positive, the Gerschgorin circle Theorem implies that all the eigenvalues of have 
strictly positive real parts (see mi Theorem 6.1.1]). In addition, Aj^ is tridiagonal 
and bi Ci_i > 0, for all z G [TV — 1]. Therefore, all its eigenvalues are real (see [TTl 
p. 174, Problem 5]). Summarizing, all the eigenvalues of Ajv are real and strictly 
positive. The same holds for the sub-matrices A^^'^ consisting of the first n rows and 
columns of A^ (with A^^'^ = A^)- Then, for each n G [TV], 9^ := det(>lj^^) > 0. 

Furthermore, since Ajy is tridiagonal, m Theorem 2] yields 

i^N^)hi = % and (A-i),,! = 16i X •• • X 

On On 

where 4>n+i — 4’n+2 — and from Lemma 2] 


'-'N+2 
qN I N 


4>k+l — + dkCk-l9}^_i4>k+2^ ^ S [-^]- 

This recursion also tells us that the coefficients are all positive, and therefore 


the same holds for the first column of ^4^^. 


The proof is completed. 


□ 


Proposition 3.6. There is a random variable Ln with values on [TV] such that 
a^ = P{LN>n), nG[TV-l]o. 

Proof. We claim that the function n i—>■ is decreasing. If this is true, we define 
:= a^_i — a^, n G [TV — 1] and Pn '■= cln-i, and deduce, for all n G [TV], that 

N 

p^ >0 and '^P^ = = 1. 

fc=i 


Consequently, there is a random variable Ln with values in [TV] satisfying that 
P{Ln = k) = Pk- The desi red result follows. It remains to prove our claim. 

From { EnI and Lemma 3.5 we have a )^_2 —a)v-i — addition, for n G [TV—1]2, 

A^- (n- 1) 


f ^ I \ / N N \ N , 

VTV J ~ ^ wi^oa„-i + 


N 


s(aC_i-a^2), 


and the claim follows using a backward induction. 


□ 


Let now (^i, ...,^^) be a vector of random variables with values in {0,1}'^ with 
the following distribution 

(Uu -7 Cjv) = (ii,m)) ■■= ^ ,N ^ —if 

V r j i—1 

and define := min{z > 1 : Ci = 0}- 

Corollary 3. 7. For all k G [TV]o, we have hk = P{G^ < k). 
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Proof. The proof is very similar to the case without mutation. Indeed, we can 
decompose the distribution of in the following way: 

N / N \ / N 

p{G^ = = = ^ 

l=l \i=l / V i=l 

N /N\ _ fN—l\ N—k tN—n—l\ 

= ^ P(L^ = £) ^ ^ > n). 

e=l \k) n=0 \k) 


The result follows from Proposition 3.6 and Equation (2.101. 


□ 


3.2.2. Asymptotic behaviour of the probabilities h^. In order to understand the 
limit behaviour of the common ancestor type distribution, we first study the coef¬ 
ficients ia^)nPo. Let us assume for a moment that these coefficients admit a limit 
when N converges to infinity. In this case, if we fix n > 2 and we take the limit 
when N tends to infinity in (P„ I, we see that the limit coefficients {ak)k>o should 


satisfy the following recurrence relation 


0 = uui o„ - (s -I- u) a„-i + s an- 2 - 


(3.7) 


Lemma 3.8. The solution of (3.7) has the form 


— 


{aoi--ai)£l {ai-£+ao)r 


yjhere £± = —■ 


Proof. Note that (3.7) is a homogeneous linear recurrence relation of second order 
with constant coefficients. Thus, its solution has the form a„ = ci -I-C 2 , where 

£+ and £- are the roots of the polynomial p{x) = uvix^ — (s -|- u)x + s and ci, C 2 
are constants. The previous equation for n = 0 and n = 1 permits to determine 
the values of the constants ci, C 2 in terms of oq and oi. The result follows. □ 


Remark 3.9. Note that £j^ = 1/(1 —Xq) G (1, 00 ) and £_ = s{1 — Xq)/uvi G (0,1), 


where Xq is defined in (2.5). 

Proposition 3.10 (Convergence of the coefficients). For all k G [fV]o, we have 




N^oo 




Proof. Since = 1 for all TV > 1, the result is true for k = 0. If we prove the 
result for k = 1, then using the recurrence relation (En), we can deduce that, for 
each k > 0, at, where {ak)k>o is the solution o^ ^ 

N—¥00 _ 

this solution is given by au = £^ and the result follows. 


N—^OO 

oi '.= £-. From Lemma 
Therefore, it remains only to prove that a 


3.8 


N 


N^oo 


> £-. From ( Ei ) and denoting 


~ ~ Nt have to show that Nip^_ 


-g£_. 


N—¥00 
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From [H Eq. 30] j ( |2.2[ ) j cLnd iisin^ th&t tt^n (k/N) = TTx N (^k'j^ wG obtcLin 


N 


Ni^N-l = 1^ 


N 


{N - 1)^1 




J2 {N -n)n{^+Wi)TrzN{^) 
1=1_ 

N 

E +uiyi)TrzN{§) 




^ N 


N 

E(i-^)f(i-f + «^i)^^«(^) 

n—1 

N 

E f (1 - f +w^"i)7rz«(#) 

n—1 

^ [zf (1 - zna - zf+ W1.1)] 




N 


+ wi)] 


Thus, 


2.7 


yields iViATv-i 


-> s(l — Xq)/uvi = £_, ending the proof. 


□ 


N—¥00 

Remark 3.11. The previous result and Remark 3.9 yield Oi = s(l — This 
expression is similar to its diffusion limit analogue, where ai = ct( 1 — a;)/(l + 
with 9 and a as in Eq. (2.31, x = [y^(l — Y)]/— Y)], and tty is the 

stationary distribution of the Wright-Fisher diffusion (see m)- 


(d) 


N —^00 


Corollary 3.12. We have Lm 
with parameter 1 — i-. 

Proof. Direct from Proposition |3.10| 


> L, where L is a geometric random variable 


□ 


We have all the ingredients to prove the convergence of the probabilities h^. 

Theorem 3.13 (Convergence of the common ancestor type distribution). Consider 
a sequence of integers (fcjv)Ar>i satisfying, for all N > 1, kx & [-^]o- Then 


.. kx 

lim 

AT—>-oo N 


= x G (0,1) ^ lim /if = h{x) := x + x'^r{l-xff = ^ -EE-- 

Af-loo ™ ^' 1 — 1-(1 — X) 

n—1 ^ ^ 


Proof. Fix n G N. Thanks to Proposition 3.10 if k^/N converges to a; G (0,1), 
then 


N kN TT IV — fcjV — j _ 

^ 11 ]V - 1 - 7 N^OO 

j=0 ■< 


>rx{i-xY 


Now, let Nq be such that, for all N > Nq, ^ > |. Then, using Proposition 
we get, for all N > Nq, 


3.6 


j=0 


('-r- 


The result follows as an application of the dominated convergence theorem. □ 


Corollary 3.14. ITe have ^ 


(d) 


-> Q, where Q is the random variable with values 

N—¥00 

in [0,1] and density function given by 


/eE) := 


l-£_ 


{l-e-{l-x)ff' 


Proof. It is straightforward to show that h{x) = P{Q < x). The result follows from 
Theorem 13.131 □ 
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4. The common ancestor type distribution: the lookdown ASG 

APPROACH IN THE FINITE CASE 


In this section, we extend to the finite population framework the construction of 
the (pruned) lookdown ancestral selection graph (LD-ASG) given in [TH]. Based on 
this construction, we provide a graphical interpretation to the equation (2.101 and 


the recurrence relation (En I. We also give a graphical meaning to the random vari¬ 


able Lm appearing in Proposition |3.6[ We assume in the sequel that the parameter 
of selection is strictly positive, i.e. s > 0. 


4.1. The ancestral selection graph. The concept of ancestral selection graph 
(ASG) was introduced in [TS] and m with the purpose of constructing samples 
from a present population, together with their ancestries, in the diffusion limit of 
the Moran model with selection and mutation. We recall here this notion in the 
finite case and we discuss its relation to the common ancestor type distribution. 

Let us start with a given realisation of the untyped 2-type Moran model of size 
N in [0,t], i.e with a realisation of the reproduction-mutation process (A(^)tg[o^T]- 
In what follows, we use the letter t for the forward time and [3 := t — t for the 
backward time. The ASG can be read off as follows (see Fig. [^. We start with a 
sample M C [N] of the population at time /3 = 0 and we trace back the lines of 
the potential ancestors. When a neutral arrow joins two individuals in the current 
set of potential ancestors, a coalescence event take place, i.e. the two lines merge 
into a single one, the one at the tail of the arrow. When a neutral arrow hits 
from outside a potential ancestor, a relocation event occurs, i.e. the hit individual 
moves to the level at the tail of the arrow. When a selective arrow hits the current 
set of potential ancestors, the individual that is hit has two possible parents, the 
incoming branch at the tail and the continuing branch at the tip. The true parent 
depends on the type of the incoming branch (see Fig. |^, but for the moment we 
work without types. These unresolved reproduction events can be of two types: 
a branching event if the selective arrow emanates from an individual outside the 
current set of potential ancestors, and a collision event if the selective arrow links 
two current potential ancestors. The number of potential ancestors decreases by 
one in a coalescence event, increases by one in a branching event, and remains 
unchanged in collision and relocation events. The previous procedure provides, 
at any time fi € [0,t], the corresponding set of potential ancestors of the initial 
sample M, which we denote by (*)|5)) where * stands for untyped and will 

be replaced later by the initial (at f = 0) configuration of types (*>0) = M). 

The untyped ASG in [0,r] of the sample M, -4||,^’^(*), consists of 


/3 = t ^ ^ /5 = 0 


h 


A .^.^. 

r 


A ^ 

) 

r 

m ^ ( 

^ 1 

.. 

•. 



t = 0 _ 

t 


t = r 


Figure 3. Untyped ASG corresponding to Fig. ^starting with the total population at 
backward time /3 = 0. 


(1) the set := U;3e[o..]{/3} x ^ ^ A]- 

(2) the configuration of arrows and circles involving the lines in . 
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When M = [A^], i.e. when we sample the ancestry of the whole population, we 
simply write instead of and instead of u). 

The true ancestry of the initial sample can be derived after assigning types, 
J G {0,1}^, to the individuals at time P = t using the following rule: propagate 
types forward in time in the ASG and keep track of the changes by respecting the 
mutation events. At every selective arrow, the incoming branch is the ancestor if 
it is of type 0, otherwise the ancestor is the continuing branch (see Fig. |^. The 


C D 

1 - 

I 

1 - 


C D 

1 0 — 

/ 

1 — 


0 1 
0 


C D 

0 1 
0 




■0 


Figure 4. Incoming branch (I), continuing branch (C), and descendant (D). The bold line 
represents the ancestor. 


resulting object is the ASG with types and is denoted by (J) (see Fig. j^. For 
V G [0, r], (J, v) represents the set of ancestors at time P = v of the sample M, 

given the configuration of types J at time P = t. As before, and in what follows, 
the upper index (M, N) will be replaced by N when M = [N], 


P = 'T , ^ _ ,3 = 0 



Figure 5. ASG with types obtained from to Fig. 3 after assignment of types at time /3 = r. 


Remark 4.1. In the finite Moran model, collisions occur with strictly positive rate. 
By contrast, in the diffusion limit setting there are no collision events (see m)- 

We denote by the random variable with values on {0,1}^ providing the 
initial configuration of types. Let be the joint law of and , such that 
under P^, is independent of A^ and uniform on all possible configurations with 
exactly k zeros. The relation between h^ and the ASG with types is given in the 
following lemma. 

Lemma 4.2. For all k G [fV]o> we have 

h^ = for some i G [iV], jf' = 0^ 

Proof. Let Tff := inf{u > 0 : 0°’^(u) G An}, where 0 °W jg offspring-type 
process defined in Section with initial configuration given by 8^’^ (0) = {f} and 
^■o.Af(Q) _ jN^ J g other words, is the first time that the progeny of 

one of the individuals at time t = 0 take over in the population. The mentioned 
individual is the common ancestor at time 0. In addition, 

= {z}| = {z is the common ancestor at time 0 , = J^} . 

Therefore, we have 

hk = Pk .Tl) = {*}, for some z G [N], Jf = o) . 
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Note also that, on {r > }, we have 

= {i}} = = {z}} . 

Since in addition, is almost surely finite, we deduce that 

= lim Pfc J^,r) = {i}, for some i G [N], jf = 0, < r'j 

= J^Pk = {j}, for some i G [N], = 0^ , 

and the proof is accomplished. 


□ 


4.2. A Markov version of the ASG and the bottlenecks. An important fact 
is that we may construct the untyped ASG in a Markovian way. By this we mean 
that we can construct a Markov process •= ^ where 

• C [N] represents the set of potential ancestors at time /3 of an initial sample 

M, i.e the analogue of given in Section 4.1 

. := is a col¬ 

lection of counting processes encoding with their jumps the reproduction and mu¬ 
tation events involving the potential ancestors of M. 

To see this, we first consider := (A^)tgR the reproduction-mutation pro¬ 
cess defined on the entire real line, i.e. the graphical representation of the Moran 
model between —oo and oo. Since A^ is a finite collection of independent Poisson 
processes, it follows that 

(Af )te[-.,o] = (Af )te[o..], and A^ := = A^. 

These identities in law motivate the following construction of the process ■ 

We start with the sample M and we read off the configuration of arrows and circles 
given by A^ as follows: 

• if t = —/3 is a jump of the process and j G we have two options: 

* if i G , a collision occurs. We set A^"^ = 

A if t ^ A^l^, a branching occurs. We set A^'^ = A^_l^ U {i}. 

In both cases, we set {M, (]) = {M, P) + 1. 

• If t = —j3 is a jump of and j G A^’^, we have two possibilities: 

* if i G A^’^, a coalescence occurs. We set A^'^ = A^’^/{j}- 

A if i ^ A^l^, a relocation occurs and we set A^'^ = (A^’^/{j})/ U {i}. 

In both cases, we set {M,j3) = + 1. 

• if t = —/3 is a jump of and i G A^l^, a mutation to type 0 occurs. We set 

A"’^ = A^l^ and + 1. 

• if t = —/3 is a jump of and i G A^l^, a mutation to type 1 happens. We set 

A"’^ = Ajl’^ and + 1. 

The so-constructed process x^’^ is clearly a Markov process, and leads to the 
following definition of the untyped ASG of the sample M in the interval [0, r]: 

aM.N / ^ { m,n\ 

(•) - (x, ■ 

We call x^’^ ancestral selection process. Another advantage of this construc¬ 
tion is that, if Ti > T 2 > 0, A^’,^j(*) is the restriction of A|^’^j(*) to the interval 
[0,T2]. Moreover, we can define the untyped ASG in the entire positive real line as 


aM,N , ^ ( M,N\ 

^[o;oo)(*) •= [xp ) 


p>0 
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Given a configuration of types, J G {0,1}^, at time [3 = t, the corresponding 
ASG with types in [0, r], is obtained by propagating types in 

and extracting the true genealogy as before. 

Note that the process := counting the lines in the untyped 

ASG of the sample M, i.e. := \A^'^\, is a birth-death process with rates: 

n 1 i\ k{k — 1) /7 7 k{N — k)s 

g^iv(fc,fc - 1) := - — -, and (fc, fc-h 1) := --. 

As in the diffusion limit setting (see [H]), for r sufficiently large, has 

bottlenecks, i.e. times at which it consists of a single line (see Fig. [m. Indeed, 
define := inf{u > 0 : = 1}. Since jg irreducible Markov 

chain with finite state space, the time is almost surely finite. Moreover, for 

all T > is a bottleneck of A ^’^(*). 


H = T (7-/3. /7 = n 








f 

t- i 


1 ^ 

i 

1 

i t 



_ 

* 1 1 

f 



Figure 6. Untyped ASG with a bottleneck at backward time /3 = /Sq. 


In particular, if we are interested in the common ancestor type distribution, 
instead of following the ancestry of the whole population, we can equivalently follow 
the ancestry of the one individual at the bottleneck . The following lemma 
formalises the fact that sooner or later all the lines in the sample coalesce into the 
ancestral line. 


Lemma 4.3. For all k G [N]o, we have 

for some i G [N], = oj. 


T—¥OQ 

nN 


Proof. Since is almost surely finite, we deduce from Lemma 


4.2 


that 


= lim Pk r) = {i}, for some i G [N], = 0, t > . 


Note that, since is a bottleneck, for r > , the true genealogy of ,.](*) 

in the interval [Tj(^,t] depends only on {x ^)on the configuration of 
types at time r. In addition, the true genealogy at any time in \T^ ^t] consists 
of only one individual. In particular, we denote by *((x^)/ 3 g[t~,t]) the unique 
ancestor of the whole population at time r. Therefore, we have 


= lim Pk 

r—7-00 


tN 






= 0 , 


>Tif 


(4.1) 


Now, we consider := the natural filtration associated to the process 

A^, and we note that is an F-stopping time. Therefore, conditioning on 
and applying the Markov property, we obtain 


Pk 




= 0, T > 


rpN 


= Ek 


l{r>r”} Pk 


r-T, 


'N 


A 


N 


(4.2) 
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where, for d > 0 and j G [N], 




and i((x^'^^’^)/ 3 G[o,?;]i J^) is the ancestor at time v of the individual placed at level 
j at time 0, given the configuration of types . Equivalently, we have 

Wo.if = {*((xf ’'^)/3g[0..], J^)}. 

We conclude that 

Pkiv,{j}) = Pk = {f}, for some i G [N], jf = o) . 

Moreover, due to the exchangeability of the lines, Pkiv, {j}) does not depend on j, 


and then Pkiv, {j}) = Pkiv, {!})■ Plugging this in \A.2\ and using (4.1 1 , we get 
hk = lim Ek (r - . 

T—¥00 L ^ J 

Since is almost surely finite, we achieve the proof applying the dominated 
convergence theorem. □ 


Remark 4.4. Due to the exchangeability of the lines, the relocation events do not 
affect the common ancestor type distribution, and therefore, can be ignored. 


4.3. Classification of paths in the ASG. The purpose of this paragraph is to 
better understand the composition of the untyped ASG and the passage to the 
ASG with types. Moreover, we would like to discriminate between the relevant and 
the irrelevant information provided by the ASG. The discussion presented here will 


serve also as a motivation to introduce new objects encoding more efficiently the 
common ancestor type distribution. With this purpose, we fix r > 0, M C [N] and 
a realisation of (*). We call a path in (*) a subset of U/ 3 g[o ^ A^"^ 
of the form 


7 := 


IJ [rfc_i,Tfe) X {ik} 


[{^m} ^ ; 


A -1 

where 0 = tq < • • • < = t, G [N], and for all ik yf ik+i, Tk is a jumping time 

of or i.e. there is an arrow going from ik+i to 4 - 

The path 7 is said to be neutral if it uses only neutral arrows. If the path is not 
neutral, we denote by < • • • < , the times where the selective arrows appear, 

and := r. We call 7 almost neutral if it is not neutral, there is at least one 

mutation in , r], the first mutation after being to type 0, and on each interval 
[rfc, containing mutations, the first mutation after t/j. is to type 0. We say 

that 7 is fictitious, if there is an interval containing mutations, and such 

that the first mutation after Tk- is to type 1. Finally, we say that 7 is truly selective 
if it is nor neutral, nor almost neutral and nor fictitious. Equivalently, 7 is truly 
selective if there is no mutation in r], and the restriction of 7 to [ri, TkJ\ either 
has no mutations or is neutral or almost neutral. Examples of this classification of 
paths are illustrated in Figure]^ 

Fictitious paths can not be used, independently of the configuration of types at 
time T. Hence, fictitious paths are part of the irrelevant information in 
In order to identify all the irrelevant material, we give a second classification of 
paths. 

From the construction of the untyped ASG, paths are never hit by a neutral 
arrow. In an almost neutral path, selective arrows are always used. We call 7 
irrelevant if it is fictitious or if it is hit by an almost neutral path. We say that 7 
is relevant if it is not irrelevant. Irrelevant paths are never used, independently of 
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Truly selective path : 

-2 -•- 


Neutral path : 



Almost neutral path : H- 


Figure 7. Examples of neutral, almost neutral, fictitious and truly selective paths ex¬ 
tracted from Fig. 


the configuration of types, i.e. they contain irrelevant information on (*). On 

the contrary, for a relevant path 7 , there is always a configuration of types J such 
that 7 S Motivated by this fact, we define the relevant untyped ASG as 

^[ly’rT■= f"! ^ : 7 is a relevant path}, (4.3) 

and from the discussion above, we see that 

U (4.4) 

A relevant neutral or relevant almost neutral path, which is hit only by irrelevant or 
truly selective paths is called immune. Immune paths are exactly the true ancestors 
of the sample M, when the configuration of types at time r consists of only ones. 
In particular, there are at most \M\ immune paths in 7l^’^(*). 


4.4. The case without mutation. We assume in this section that u = 0. In this 
situation, as in the diffusion limit case (see [ 22 ), a natural link emerges between 
the distribution of the common ancestor and the stationary number of lines in the 
ancestral selection graph. 

Thanks to Lemma |4.3| it is sufficient to analyse the ASG starting with a single 
individual at level 1. Moreover, since there are no mutations, we have only neutral 
and truly selective paths in 7l^g^^j^(*). In addition, we have exactly one immune 
path, which is neutral. All the other paths are truly selective and hit the immune 
path at some time in [0,r]. Therefore, the immune path is the common ancestor if 
and only if all the individuals in are of type 1. Since truly selective paths 

can be only used by type 0 individuals, we deduce that the common ancestor is of 
type 0 if and only if one of the individuals in aV^’^ is of type 0. Thus, conditioning 
on the number of lines in a\A'^^ we get 

Pk (Jf = 0 , where: r) = {*}) = ^ = i) - 1 - ^- 1 . 

i=l \k) 


Taking the limit when t tends to infinity in the previous expression and using 
Lemma |4.3l we obtain 

N — k — j 
N-l-j' 


kt=E p{ks'-^ = t) J. “ ’ = >") n 

£—1 \k) n—0 j—0 


Using the detailed balance equation, it follows that the stationary num¬ 

ber of lines in the untyped ASG, is distributed as a binomial random variable with 
parameter N and s/(l -I- s) conditioned to be strictly positive. Thus, we have re¬ 
covered the results of Section |3.1| and established that the random variable Ln 
corresponds to the stationary number of lines in the ASG. 
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4.5. The case with mutation: the relevant ASG. From now on, we assume 
that s,u > 0. In this case, as discussed in Section [4.3| paths which are never 
used appear in the untyped ASG. The untyped relevant ASG defined in Section [F3] 


permits to obtain a first graphical interpretation to Eq. (2.101. 

We denote by the number of lines at time t in the relevant untyped ASG, 
^[Ot]^' graphical representation of is given in the next lemma. 

Lemma 4.5. The random variables {R^)r>o converge in distribution to a random 
variable R^ with values in [A] and 


N-k n-1 ,, , 


n—0 


j=0 


A-l-j 


Proof. From Lemma 4.3 and Eq. (4.4), we deduce that 




= lim Pk 

T—¥(X> 


,t) = {i}, for some i G [N], = 0^. (4.5) 


From the definition of we see that it consists of one immune path and 

truly selective paths hitting the immune path at some time between 0 and t. If 
consists of only ones, the ancestral line is the immune path and the type of the 
true ancestor is 1. If is not identically one, we have two possibilities: (1) there 
is only one 0 in at the immune path, and the ancestral line is the immune path, 
or (2) there is at least one 0 in at a truly selective path, then the ancestral line 
is the truly selective path with type 0 at time t, which is not hit by another truly 
selective path with type 0 at time r (there is always such a path, since the number 
of truly selective paths is almost surely finite). In both cases, the true ancestor is 
of type 0. In conclusion, the true ancestor at time r is of type 0 if and only if one 
of the lines in the untyped relevant ASG is of type 0 at time r. We conclude that 


{P 


N 

= 0, where: J^, r) = {i}) ='^P{R^ = £) 


Ck) - iV) 




(4.6) 


Let (i?(y)„>o be a subsequence of {R^)t>o. Since the involved random variables 
share the same finite state space, we conclude that (i?)y )„>o is tight. Therefore, 
there is a subsequence {R^ )k>o which is convergent in the weak sense. We denote 
its limit by R^. Using this and Equations (14.51) and (14.61), we obtain 


N 


h^ = '^P{RZ=i) 

i=i 




(N-t 
V k 


Q 


, N-k 

n=0 


> n 


n—1 

)n 

i=o 


N — k — j 
N-l-j- 


Lemma 


3.3 


implies that the law of R^ is uniquely determined by the common 
ancestor type distribution. Since this holds for any subsequence of {R^)r>o, the 
result follows. □ 


From the previous lemma, the random variable Lat given in Proposition |3.6| 
corresponds to the asymptotic number of lines in the relevant untyped ASG. Un¬ 
fortunately, it is not easy to describe the law of (R^)- Therefore, we use a different 


approach in order to give a graphical explanation to the recurrence relation ( E., 


4.6. The case with mutation: the lookdown ASG. The recent work 


provides a graphical interpretation to Eq. (2.10) and to the recurrence relation 


(En ), in the context of the diffusion limit. This is done with the help of the pruned 


lookdown ancestral selection graph (pruned LD-ASG). Following the same lines, we 
obtain analogue interpretations in the finite case. 
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We fix M C [N] and r > 0, and we consider a realisation of Let 

0 < Ti < • • • < r„ < T be the corresponding coalescent, branching and collision 
times, and set tq := 0, t„+i := t. The LD-ASG is obtained reordering the lines of 
(*) the times r^,. The correspondence between lines in the ASG and levels 
in the LD-ASG is given, for each (3 S [0,t], by a bijective function 

/,M,N , 

[K^ ]. 

The function /3 S [Oj'T"] remains constant on the intervals [r^,Tfc+i). If 

M := {* 1 , *|M|}) with ii < ••• < im, we set 7ro(*/c) ;= k. In addition, if we 

have constructed the LD-ASG in the interval [0,Tfc), with k < n, we extend its 
construction to [r/^^Tk+i) as follows (see Fig. |^: 

• Coalescence: if at time Tk a neutral arrow goes from i to j in A^’^ (*), we draw a 

neutral arrow going from (?) (j) to (i) (j). The line i is placed 

at level Tr^^-{i) A •7r,-^-(j). The lines above 7r.,-^-(f) V are shifted one level 

downwards, and the others keep their positions, i.e. := 7r^j,-(i) A 

TTr^{e) = Trr^_{£)-1 if > TTr^-(i) VTT^^-(j), and TTr^ii) = Otherwise. 

• Branching: if at time Tk the line i branches into the lines i and j in 

an horizontal open arrowhead appears at level TT^^-(i). The incoming branch j 
emanates from it, and all the lines at levels r > 7r,-,,-(i) are shifted one level 
upwards, i.e. TTr^{j) := TTr^-{i), TTr^i^) ■= + 1 if and 

TTr^{£) := 7r,-^-(^) otherwise. 

• Collision: if at time the line i collides the line j in Aljg (.j (*), and TT^^-(i) < 
'^Tk-{j)i ’'^6 set Trf. := T^Tk--: and we draw a selective arrow from Trk{i) to TrT^.{j). 

• Exchange-collision: if at time Tk the line i collides with the line j in 

and TT^^-(i) > 7r^,,-(j), the line i takes the position of line j, all the lines at 
levels 7r,-,^-(j) < r < 7r.rfc-(*) are shifted one level upwards, and the positions of 
the other lines remain unchanged , i.e ^Tkii) ■= ^Tfc(^) := '^Tk-W + 1 if 

TTk-ij) < '^Tk-i^) < and TrT-k(£) := 7r,-,,-(t') otherwise. In addition, an 

horizontal open arrowhead appears at level 'Kr^(i). 

• Mutations: a mutation event in the ASG is pasted at the corresponding level in 
the LD-ASG. 

The resulting object is called the lookdown ancestral selection graph in [0,t] of the 


./ 

" \. 

■ ^ 


/ 




t . . . A 

Coalescence 

Branching 

Collision 

Exchange-collision 


Figure 8. Coalescence, branching and collisions in the LD-ASG 


rM,N 


sample M (see Fig. |^. We denote it by 

In what follows, we focus on the case M = {!}. In this case, has exactly 

one immune path, which can be identified in J.j^. However, we denote by im¬ 
mune line a slightly different object. The immune line in the LD-ASG is the line 
which is at any time the ancestral line if all the lines at that time are of type one. 

Now, we proceed to prune the LD-ASG. We read off from time ,5 = 0 to 

(3 = T (from left to right in Fig. using the following rules (see Fig. 10). If we 
encounter a mutation to type 0 at the immune line, we don’t do anything. If we 
encounter a mutation to type 0 at an occupied level i different from the immune 
line (i.e. i is till now an almost neutral path), we insert at this time a barrier from 
level i till level N, and we kill all the lines above the level i. If we meet a mutation 
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Figure 9. The LD-ASG (below) corresponding to the untyped ASG (above). Time runs 
from right to left. 

to type 1 at an occupied level i different to the immune line (i.e. t is a fictitious 
path), we kill the line i and we shift all the lines above one level downwards. If we 
meet a mutation to type 1 at the immune line, we relocate the immune line to the 
currently highest occupied level, and all the lines which were above the immune line 
are shifted one level downwards. The resulting object is called the pruned LD-ASG 
and denoted by £ . Note that the pruning procedure can transform collisions or 

exchange-collisions in branching events. 


o 




Figure 10. Pruning procedure in the LD-ASG. The effect of the mutations differs in the 
immune line, represented in the figure in bold. 



Figure 11. Pruned LD-ASG corresponding to Fig. The bold line represents the immune 
line. 


Proposition 4.6. The level of the ancestral line at time t in is either the 

lowest level of type 0 at time r or, the level of the immune line if all the lines are 
of type 1 at time t. In particular, the ancestral line is of type 1 at time t if and 
only if all the lines in are of type 1 at time r. 

Proof. If all the lines in are of type 1 at time r, the level of the ancestral line 
is by definition the level of the immune line. Now we assume that at least one of 
the lines in is of type 0 at time r, and we denote by the lowest type-0 level. 
We have to show that i* is also the level of the ancestral line at time r. 

Let (Ji > 0 be the first branching time and ai < a 2 < ■ ■ ■ < (Jm < o" the consecutive 
coalescence, branching, collision, exchange-collision and mutation times in 
We set (Tm-i-i := t. Along the proof, {i,l3) G [N] x [0,r] represents the individual 
at level i at time (3 in If there is at least one line of type 0 in {ak,<Jk+i), we 

denote by ik the level of the lowest type-0 level. 

We claim that, if A: G \m\ is such that there is at least one line of type 0 in 
(Tfc,(Tfc+i), and {ik,Tk) is a descendent of (f*,T), then either {ik,Tk) is over the 
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immune line, mutates to type 1 and all the lines are of type 1 in (ak-iiCrk), or 
{ik-i, Tk-i) is a descendant of (z*, r). If this is true, then starting with k = m+l and 
iterating the claim, we deduce that the unique individual in (0,cri) is a descendant 
of (z*,r), which proves the result. 

We split the proof of the claim depending on the kind of event occurring at 
time Tk- If at time Tk there is a mutation to type I in one of the lines present in 
(o'fc,crj,+i), we have three possibilities: (1) the mutation occurs at a level different 
to ik, then the individual {ik,Tk) is placed at, the maybe different, level ik-i in 
(cTfc-i, (Tfc), (2) the mutation occurs at level ik and ik is not the level of the immune 
line, and the same conclusion holds, and (3) the mutation is at level ik, which is 
also the level of the immune line, then ik is the highest occupied level in (ak, ak+i), 
and therefore in (ak-ijCTk) there are only type-1 lines. In the three cases the claim 
follows. If Tk represents a mutation to type 0, then by construction it is at the 
highest occupied level in ((Tfc,crfe+i), the individual {ik,Tk) remains at the same 
level in (ak-i, (7k), ik = ik-i and the claim follows in this case. If Tk is a branching, 
coalescence, collision or exchange-collision time, then by construction (ik,Tk) is the 
ancestor of {ik-i, 7k-i), and the proof of the claim is completed. □ 


Following a similar procedure as in Section [4.2| we construct in a Markovian way 
the pruned LD-ASG together with the level of its immune line. More precisely, we 
construct a Markov process of the form := ,Tp ,Xp)p>o, where 

• Cp G [A^] represents the number of occupied levels at time (3 in the LD-ASG. 

• := is a collection of counting 

process encoding the mutation, branching, collision, exchange-collision and coales¬ 
cence events. 

• G [A^] represents the level of the immune line at time /3 in the LD-ASG. 

The graphical representation of the process L'^ is obtained by drawing horizontal 
lines at all the occupied lines and interpreting the mutation, branching, collision, 
exchange-collision and coalescence events exactly as in the previous construction of 
the LD-ASG. Due to the exchangeability of the lines, we can read off the dynamics 
of the pruned LD-ASG from the configuration of arrows and circles given by the 
reproduction-mutation process := {Af)t^R. We start with Cq := 1, Tq := 0 
and := 1 and we proceed as follows: 

• if t = —/3 is a jump of the process and j < C^_, we have three options: 


* if z > Cp--. a branching occurs at level j. We set Cp = -I- 1 and 

r^’^(/3) = + 1. In addition, if j < 1^-, we set = Ip_ + 1, 

otherwise we set = Xp_. 

■k if z < j, a collision occurs between z and j. We set = C^-, = 

r^;f(r) + l,andJ^=J^_. 

* if j < z, an exchange collision occurs between z and j. We set 

and r^f (/3) = r;f (/3-) + 1. If X^_ = i, we set X^ = j. If X^_ = j, then 
Xp = j + \. Otherwise, we set Xp = Xp_. 


• If t = —p is a jump of Afand i V j < Cp_: a coalescence occurs between 

the levels z and j. Denoting, k = i A j and £ = i\/ j, we set Cp = — 1 and 

rtf (/5) = rtf (/?■) + 1- If 2 :^- > then x^ = X^_ - 1. If = e, then X^ = k. 
Otherwise, we set Xp = Xp_. 

• if f = —/3 is a jump of A°’'^ and z < : a mutation to type 0 occurs at level 

i. We set rtf (^) = rl’eip-) + 1. If If. = i, then and /if = /if.. If 
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Ip- ^ i, then Cp = i, and Ip = i ii Ip_ > i or Ip = Ip_ otherwise. 

• if t = —/3 is a jump of and i < a mutation to type 1 occurs at level 

i. We set rlfiP) = + 1. If = i, then I^ = L^_ and = C^-. If 

Ip- yf i, then = Cp- — 1 and Ip = Ip- — 1 if Ip- > i or Ip = Ip- otherwise. 

• other jumps in are ignored. 

From construction, the line-counting process = (£^)^>o is a continuous-time 
Markov chain with state space [N] and transition rates given by 




i {N — i) N~^ s 

i{i — 1)N~^ + {i — 1) uvi + wq 

UVq 

i {{N — i)s + {i — 1)) N~^ -I- (* — 1)' 

0 


if j = * -k 1, 

if j = i - 1, 

if j e z-2}, 

if j = i, 

otherwise, 


(4.7) 

for i,jG [iV]. Since is irreducible and its state space is finite, it has a unique 
stationary distribution, denoted by . Let be a random variable distributed 
as p^. The next result gives a new graphical interpretation to the expression (2.101. 

Proposition 4.7. We have 


N-k 




■= ]v E > ^) n 


n—O 


j=0 
N 


N-k-j 
N-l-j- 


In addition, if we define, for each n G [IV]o, a„ := P{£^ > n), then (a„ )„=o 
satisfies (En) for all n G [IVJq. 

Proof. The first part of the statement follows from Proposition |4.6| using similar 
arguments as in Lemma |4.5| 


From definition P{C^ > 0) = 1 and hence (Eq) is satisfied. Using the first 
statement for A: = iV — 1 we deduce that {Ei I holds. Furthermore, since p^ is the 
stationary distribution of , we have p^Qc" = 0, where Qc^ is the generator of 
. In particular, for n G — 2 ( 2 , the n-th cequation in p^Qc" = 0 reads 


Pn-l 


{n — l)(fV — n -I- 1) 


= p; 


N 


N 

n{n — 1) 
N 


Pn+1 


i{n + 1) 

”“/v 


nwi + uvq 


j=n+2 


, . n[N — n) / „\ 

+ [n — l)wi + uvq -\ -—- s + ui 2 o{n — 2) 


Reordering the terms we obtain 


in-l) p‘:_. 


N-n+1 

N 


N 


N N 
-uvoa^ -p„ 


n 

IN 


NT + M 


( pN 

N — n 


n+l 

\ Pn 

N 

^^O^n+l rn+1 

N 


(4.8) 


In a similar way, we derive from the first equation in p^ Qc" = 0 that 


Pi 


r^-1 1 

N N 

\ 2 1 

N 

-uvoa^ - P 2 



= 0 . 


Using, (4.8 1 and (4.9), we deduce, for all n g[N — 2], that 


„N 


N 

N 


N 

UVoan+1 


Pn+1 


n+l 

N 


= 0. 


(4.9) 


It is straightforward to see that the previous equation is equivalent to ( En ). It 


remains only to prove that the missing equation (E+f] holds. The latter is easily 
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obtained from the last equation in Qc’^ = 0, which is given by 

P%.,^ = pU^ + u). 

□ 


5. The asymptotic common ancestor type distribution and the 
ASYMPTOTIC PRUNED LD-ASG 


In this section we aim to give a probabilistic interpretation to the function h ap¬ 
pearing in Theorem |3.13| To do so, we construct, in the setting of the deterministic 
limit, an asymptotic version of the pruned LD-ASG. 

Let us first study the asymptotic behaviour of . It is straightforward to see 
that, for any *, j S [A^]o, qc"{hj) — -^ qcihj), where qc is defined by 


qc{i,j) 


i s 

{i — 1) u 1^1 + uvq 

< UVq 

u — {s + u)i 

0 

v 


a j = i + i, 

if j = z -1, 

if j G {l,...,n-2}, 

if j = i, 

otherwise. 


(5.1) 


Let C = be the continuous-time Markov chain corresponding to the tran¬ 

sition rates given in (5.11 starting at Cq := 1. We refer to C as the asymptotic 
line-counting process. 


Lemma 5.1. The process C is non-explosive. 

Proof. From Proposition 8.7.2] the non-explosive condition is equivalent to 


E 


qci^Sk) 


= oo a.s.. 


where So < Si <•■ ■ denote the jump times of C and gc(n) := —qc{n,n). In 
addition, we have Cs^ <k-\-2. Therefore, 


CXJ ^ tXJ ^ 

This concludes the proof. 


^E 

k=0 


1 

(u -I- s)(fc -I- 2) — u 


= oo a.s.. 


□ 


The next proposition formalises the convergence of to C. 

Proposition 5.2. The sequence of line-counting processes {C^)pj->i converges in 
distribution to the line-counting process C. 


Proof. We denote by (Doo,ci^) the space of cadlag functions on [0,oo) with values 
on N equiped with the metric defined in Appendix [A| i.e. with the Skorohod 
topology. We have to show that, for all uniformly continuous and bounded function 
F : Doo —t R, limAr_>oo = E[F{C)]. Let F be such a function and fix 

fc G N. Note that for every N > k 


E [F(/:^)] 




E[E{C)] \ <E^F{C^)-F\ 

+ \E [F (T.ATfe(£«))] - E[F (T.ATfc(z;))] 

+ E[\F{C.i,T,ic))-F{C)\], 


(5.2) 


where is the function defined in Appendix [ a| Note that Tk{C^) and Tk{C) are 
a.s. finite. We denote by E\^ Ej^ ^ and E^ respectively the first, second and third 
term on the right-hand side of (|5.2[) . 
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The processes C^{k) := and £(A:) := (£tATfc(£))t>o are continuous¬ 

time Markov chains with space state [k]. Moreover, it is straightforward to see that 
the transition rates of £*^>^ converge to the transition rates of £*^. This in turn 
implies, for every real-valued, bounded function on [fc], that 

lim sup |^£«(fc)/(i) - Ac{k)f{i)\ = 0, 

where and A^^k) denote the generators of C^{k) and £(fc). From [5J Theo¬ 

rems 1.6.1 and 4.2.11] we get that [k) converges in distribution to £(fc). Thus, 

hm = 0. (5.3) 

Fix e e (0,1). Since, F is uniformly continuous, there is 5f{£) € (0,£) such that 


d°^{w, IP*) < (5(e) \F{w) — F(ry»)| < e. 

As a consequence of this and Lemma EH we deduce that 

El^ <e + 2\\F\\^P{Tk{C^) < nF{e)), (5.4) 

where UFi^) ■= Llog 2 ( 5 (^)J + 1- Similarly, we get 

El <e + 2\\F\\^P{Tk{C) < nF{s)). (5.5) 

Note that Tk{C^) = Tk{C^{k)) and Tk{C) = Tk{C{k)). Since {k) converges 
in distribution to £(fc) and the function T^. is continuous (see Lemma A.21, we 
deduce from the mapping theorem (see |3] Theorem 2.7]) that Tk{C^) converges in 
distribution to Tk{C). In particular, from ( |5.4[ ), we get 

limsupAfc jv < e-k 2llFlloo£('£fe(£) < nF(e))- (5.6) 

N—^oo 


Using (5.31,(5.5) and ( |5.6[ ), we obtain 

limsup|£[£(£'^)] -E[F{C)]\ < 2£ + 4llFlloo£(rfe(£) < UFie)). 

N^oo 


Lemma 5.1 implies that the last term converge to zero when k tends to 00 . Sum¬ 
marizing, we have proven that, for all £ G (0,1), 


lim sup I £ [F(£^)] - £ [£(£)] I < 2£. 

N^oo 


The result follows. 


□ 


Lemma 5.3. The process £ has a unique stationary distribution, which is given 
by the geometric distribution of parameter 1 — 


Proof. If p := {pk)k>i denotes the geometric distribution of parameter 1 — one 
can easily check that p is a stationary distribution for Qc, i.e pQc = 0- Since Qc 
is irreducible and non-explosive, we deduce from [?n Theorem 3.5.3] that Qc is 
positive recurrent. Therefore, the uniqueness of the stationary distribution follows 
from Theorem 3.5.2]. □ 


Let £^ and £00 be rando m vari ables following the stationary distributions £^ 

3.12 can be translated in terms of Cfi and £00 as 


and £, respectively. Corollary 


£: 


N—^oo 


-> £n 


(5.7) 


Using this and Proposition |4.7| we recover Theorem |3.13|in the following form 


^ 77-(0(1) 

iV N—^00 




N^oo 


> ^ £(£00 > n)x{l — x)" = h{x). 


n—0 


Now, we aim to construct an asymptotic version of the pruned LD-ASG having 
£ as line-counting process. Note that the coalescence and collision rates in the 
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finite ASG, and the LD-ASG, converge to zero when the population size tends to 
infinity. In particular, collisions and coalescence events will be absent in any suitable 
asymptotic version of the pruned LD-ASG. Thus, the notion of common ancestor 
does not make sense anymore. Nevertheless, we provide a nice interpretation to the 
function h{x) at the end of this section. 

We turn now to the construction of the asymptotic pruned LD-ASG. We point 
out first that the abscence of coalescence implies that the immune line will be always 
located at the highest occupied level. 

We start with a realisation of started at Cq = 1. If ti > 0 is the 

first jumping time of £, then the asymptotic pruned LD-ASG in [0 ,ti) consists of 
a single individual placed at level 1. Moreover, the asymptotic pruned LD-ASG 
remains constant on intervals of the form [r, r,), where t < r* are two consecutive 
jumps of L. It remains to describe the evolution at the jumping times of £. To do 
so, assume we have constructed the asymptotic pruned LD-ASG in [0, r) and that 
T is a jumping time of £ 

• If £r — £r- = Ij then a branching event occurs. A star appears at a level 
i chosen between the Cr- current lines. The incoming branch emanates 
from the star and all the lines at levels k>i are shifted one level upwards. 

• If £,- — £^- = —1 and £_- = n, with probability -— a mutation 

to type 0 occurs at level n — 1, we insert an infinite vertical barrier starting 
from level n — 1 and we kill all the lines above level n — 1. If no mutation 
to type 0 happens, then a mutation to type 1 takes place at a level i < n 
chosen at random, the corresponding line is killed and we shift all the lines 
above one level downwards. 

• If £t — £r- < — 1 and Cr = n, a mutation to type 0 occurs at level n, 
we insert a vertical barrier from level n till infinity and we kill all the lines 
above level n. 

Now we introduce a new notion which plays the role of the common ancestor in the 
deteministic limit setting. We start at time /3 = 0 with a generic individual in the 
population, and we trace back the type of its ancestor. Furthermore, the type of 
the ancestor at time /3 of the chosen individual is denoted by and is called the 
representative ancestral type at time /3. 

Remark 5.4. In the finite case, after the first botleneck, the representative ancestral 
type coincides with the common ancestor type. 

Proposition 5.5. The representative ancestral type is 1 at time P if and only if 
all the lines in the asymptotic pruned LD-ASG at time /3 are of type 1. 

Proof. The proof is analogous to the proof of Proposition |4.6| □ 

Let G {0,1} be the type that is assigned at time /3 to the individual placed 
at level k G {l,...,Lp} in the asymptotic pruned LD-ASG. We assume that the 
asymptotic pruned LD-ASG is constructed under a probability Px such that the 
types are assigned in an i.i.d. manner with Px{J^ = 0) = x. The following result 
provides a probabilistic meaning to the function h. 

Corollary 5.6. For all x G (0,oo), h{x) = lim Px{Ig = 0). 

Proof. From Proposition |5.5| we see that: 

CXD 

Px{i; = 0) = ^ = 0,4 = IVfc <n,Cfs> n) 

n—0 

oo oo 

= II = 0, J| = 1 Vfc < n) Px{Cp > n) = ^ a;(l - x^PxiCp > n). 

n—0 n—0 
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Taking the limit when (3 tends to infinity in both sides and using (5.71, we get 



The result follows from the definition of h and Lemma 


EM 


□ 


Remark 5.7. Assume that we are in the stationary regime, i.e. under P^+, then the 
representative ancestral type coincides with the population average of the ancestral 
types in a related branching model (see HOI Theorem 3.1]/ This is formalised in 
Lemma \B.1\. 


Appendix A. Some remarks on the Skorohod topology in Dn[0,oo) 


For each t G (0, oo), we denote by D* := Dn[0, t] the space of cadlag functions on 
[0,t] with values on N. We start by recalling the Skorohod topology in D(. 

Let Cj be the class of strictly increasing, continuous functions from [0,t] onto 
itself. For A G Cj, we set 


;= sup 


log 


A(s) — A(u) 


s — u 


Now, we define the metric in as follows: 

d°tU,9) ■= inf {||A||° V \\f - g o 
\ec] 


where ||/||t,oo := supsg[g_(] |/(s)|. The metric d^ gives the Skorohod topology in Dj. 

Similarly, we denote by Dec := Dn[0,oo) the space of cadlag functions on [0,oo) 
with values on N. Let C/, be the class of increasing, continuous functions from 
[0, oo) onto itself. 

Now, we define, for each to G N, the function Xm by setting Xmis) := 1 for 
0 < s < TO — 1, Xm(s) := TO — s for TO — 1 < s < TO, and Xm(s) := 0 for s > to. 
Finally, we define the metric d^ in Dqo as follows: 


C(/,5) 


E 


1 A d^ {xjn f , g) 
' ' *2™ ’ 


The metric d%^ gives the Skorohod topology in Doo. 


Lemma A.l. For all w G Doo cind t G [0, oo), we have 


dlo{w,w{- At)) < 

Proof. Note first that for all m <t, d°^{xm w, Xm w{- A t)) = 0. Therefore, we have 




m— [ij +1 


m— [ij +1 


1 

2lA ■ 


□ 


Now, given fc G N and w G Dqo, we define Tk(w) := inf{s > 0 : w{s) = k} and 
Doo(/c) := {w G Doo : Tk{w) < oo}. 

Lemma A.2. The function T^ : Doo (A:) [0,oo) is continuous. 

Proof. Let us consider {rCnjraGN C Doo(fc) and w G Doo(fc) such that 


n—¥oo 










28 


F. CORDERO 


We have to show that Tk{wn) - Tk{w). From [31 Theorem 16.1], there is a 

n—^oo 

sequence {A„}„gN C such that 

sup |A„(t) — t\ -0 and sup |w„(A„(t)) — w(t)| -0, 

t<oo n—yoo t<m n—>-oo 

for all TO G N. Therefore, defining to/j := Tk{w) + 1, we deduce that for all e > 0, 
there is no(e) G N, such that, for all n > no(e): 

sup |A„(t) —1| < £ and sup |?«„(A„(f)) — w{t)\ < £. 

t<oo t<m 

This implies for e G (0,1) that for all n > no(e) 

sup \Wn{Xn{t)) - w{t)\ = 0. 

t<mk 

Therefore, w„(A„(Tfc(r(;))) = k and, for each t < Tk{w), r(;„(A„(t)) ^ k. This 
implies that Xn{Tk{w)) = Tk{wn)- Consequently, we have 

\Tk{wn) - Tk{w)\ = \Xn{Tk{w)) - Tk{w)\ < e. 

The continuity of follows. □ 


Appendix B. A related 2-type branching process 

Assume that u > 0 and s > 0. We consider a population composed of individuals 
of two types, 0 and 1, evolving in the following way. Each individual of type 0 waits 
for an exponential time with parameter 1-|-s-|-mpi, and then splits or mutates to type 
1 with probabilities (1-|-s)/(1-|-s-|-mi^i) and to^i/(1-|-s-|-wi/i), respectively. On the 
other hand, an individual of type 1 waits for an exponential time with parameter 
1 -|- uvq, and then splits or mutate to type 0 with probabilities 1/(1 -I- uvq) and 
MPo/(l + uiyg), respectively. In other words, the population evolves according to a 
2-type branching process in continuous time. We summarize here the asymptotic 
properties of this model (see e.g. PQ and 0). 

In this framework, consider the process Y = where F/, j G {0,1} 

denotes the number of individuals of type j at time t. Additionally, we denote by 
Ei[Yf] the expected number of /-individuals at time t in a population started by 
a single i-individual at time 0. It is well known that Ei[Y^] = where A is 

the matrix 


A:= 


1 -I- s — Wi 

UVq 


UVi 

1 — UVq 


(B.l) 


The asymptotic properties of Y are expressed in terms of the largest eigenvalue 
of the matrix A, A+ = 1-1- sx]}, and the corresponding, properly normalized, left 
and right eigenvectors. Additionally, if tt := (ttojTTi) denotes the left eigenvector 
associated to A+ normalized such that ttq + 'Kx = 1, then tiq := Xq and tti := 1 —a;]}. 
The right eigenvector of A associated to A+, h = (ho, hi), normalized such that 
hoT^o + hiTTi = 1 is given by 


hn — 


Wo 


■ SXn 


and hi = 


Wo 


Wo 


Wo 


We know from (2.6 1 that Xq is the asymptotic proportion of individuals of type 


0 in the deterministic 2-type selection mutation model. The same interpretation 


for x]} holds in the branching model, since 


yO 

lim = X, 

t^oo yo y 1 


+ 
0 > 


conditionally on survival. 
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Moreover, we have 

In (i>o + f/) 

lim -= 1 + sXq , conditionally on survival, 

t —^OO "t 


i.e. A+ is the asymptotic growth rate of the population. In addition, hi, for i S 
{0,1}, measures the asymptotic mean offspring size of an i individual, relative to 
the total size of the population. More precisely, we have 


lim 

t—¥(X) 


Y° + y 1 




= hi 


Finally, the vector a := (ooi <ti) given by ai := hiiTi, for i G (0,1}, describes the 
population average of the ancestral types and is called the ancestral distribution. 


Lemma B.l (representative ancestral type and ancestral distribution). We have 

- A + (s + m)-\/A h^Xg) 

2A 

where A := (s — + ^uvqs. In particular, we have h{xQ) = ag, i.e. the represen¬ 

tative ancestral type distribution starting with a stationary configuration equals the 
ancestral distribution in the 2-type branching model described here. 

Proof. From the definition of x^ and A, we have 2(uiyo-\- sXq) = 2muo + s —t6 + -\/A, 
and that 2s{uvq + sx)} ) = A + (s — u)^fK. Thus, we obtain 

- s{2ui'o-\-s — u-\-VA) s{2uvo + s — u-\-V A) A — {s — u)\/A 

A + (s — m)\/A a — {s — u)\/A a — {s — u)\/A' 

and the first identity follows after simplifications. In order to obtain the second 
identity, we note that, from the relation between £_ and 

2suui(l — i-{l — Xq)) = 2s(2mui — u — s + (s + = —A + (s + u)'/A. 

As a consequence, we have 

h{xQ) 2sui'i 

Xq —A T (s T u)y/~A 

and the second identity follows using similar arguments as before. □ 


Acknowledgements. I would like to thank Ellen Baake for bringing this problem 
to my attention and for enlightening the development of this work. I am also grate¬ 
ful to Tom Kurtz and Anton Wakolbinger for stimulating and fruitful discussions. 
This project received financial support from the Priority Programme Probabilistic 
Structures in Evolution (SPP 1590), which is funded by Deutsche Forschungsge- 
meinschaft. 


References 

[1] Baake, E., and Georgii, H. Mutation, selection, and ancestry in branching models: a 
variational approach. J. Math. Biol. 54-, 2 (2007), 257-303. 

[2] Bialowons, R., and Baake, E. Ancestral processes with selection: Branching and Moran 
models. Banach Center Publications 80 (2008), 33—52. 

[3] Billingsley, P. Convergence of probability measures, second ed. Wiley Series in Probability 
and Statistics: Probability and Statistics. John Wiley Sz Sons, Inc., New York, 1999. A 
Wiley-Interscience Publication. 

[4] Cordero, F. The deterministic limit of the moran model: a uniform central limit theorem. 
http://arxiv.org/abs/1508.05231 (2015). 

[5] Crow, J., and Kimura, M. An Introduction to Population Genetics Theory. Harper and 
Row, New York, 1970. 

[6] Durrett, R. Probability models for DNA sequence evolution, second ed. Probability and its 
Applications (New York). Springer, New York, 2008. 



30 


F. CORDERO 


[7] Etheridge, A. Some mathematical models from population genetics, vol. 2012 of Lecture 
Notes in Mathematics. Springer, Heidelberg, 2011. Lectures from the 39th Probability Sum¬ 
mer School held in Saint-Flour, 2009. 

[8] Ethier, S. N., and Kurtz, T. G. Markov processes. Wiley Series in Probability and Math¬ 
ematical Statistics: Probability and Mathematical Statistics. John Wiley Sons, Inc., New 
York, 1986. Characterization and convergence. 

[9] Fearnhead, P. The common ancestor at a nonneutral locus. Journal of Applied Probability 
39, 1 (2002), 38-54. 

[10] GeorGII, H.-O., and Baake, E. Supercritical multitype branching processes: the ancestral 
types of typical individuals. Adv. in Appl. Probab. 35, 4 (2003), 1090-1110. 

[11] Horn, R. A., and Johnson, C. R. Matrix analysis. Cambridge University Press, Cambridge, 
1985. 

[12] Kimura, M. On the probability of fixation of mutant genes in a population. Genetics fl (jun 
1962), 713-719. 

[13] Kluth, S., and Baake, E. The moran model with selection: Fixation probabilities, ancestral 
lines, and an alternative particle representation. Theoretical Population Biology 90, 0 (2013), 
104-112. 

[14] Kluth, S., Hustedt, T., and Baake, E. The common ancestor process revisited. Bull. 
Math. Biol. 75, 11 (2013), 2003-2027. 

[15] Krone, S. M., and Neuhauser, C. Ancestral processes with selection. Theoretical Population 
Biology 51, 3 (1997), 210-237. 

[16] Kurtz, T. G. Limit theorems and diffusion approximations for density dependent Markov 
chains. Math. Programming Stud., 5 (1976), 67-78. Stochastic systems: modeling, identifica¬ 
tion and optimization, I (Proc. Sympos., Univ. Kentucky, Lexington, Ky., 1975). 

[17] Kurtz, T. G. Approximation of population processes, vol. 36 of CBMS-NSF Regional Confer¬ 
ence Series in Applied Mathematics. Society for Industrial and Applied Mathematics (SIAM), 
Philadelphia, Pa., 1981. 

[18] Lenz, U., Kluth, S., Baake, E., and Wakolbinger, A. Looking down in the ancestral selec¬ 
tion graph: A probabilistic approach to the common ancestor type distribution. Theoretical 
Population Biology 103 (2015), 27-37. 

[19] Mano, S. Duality, ancestral and diffusion processes in models with selection. Theoretical 
Population Biology 75, 2-3 (2009), 164-175. 

[20] Neuhauser, C., and Krone, S. The genealogy of samples in models with selection. Genetics 
145 , 2 (February 1997), 519-534. 

[21] Norris, J. R. Markov chains, vol. 2 of Cambridge Series in Statistical and Probabilistic 
Mathematics. Cambridge University Press, Cambridge, 1998. Reprint of 1997 original. 

[22] POKALYUK, C., AND Pfaffelhuber, P. The ancestral selection graph under strong directional 
selection. Theoretical Population Biology 87 (2013), 25-33. Coalescent Theory. 

[23] RioRDAN, J. Inverse relations and combinatorial identities. Amer. Math. Monthly 71 (1964), 
485-498. 

[24] Taylor, J. E. The common ancestor process for a Wright-Fisher diffusion. Electron. J. 
Probab. 12 (2007), no. 28, 808-847. 

[25] TodorOVIC, P. An introduction to stochastic processes and their applications. Springer Series 
in Statistics: Probability and its Applications. Springer-Verlag, New York, 1992. 

[26] USMANI, R. A. Inversion of Jacobi’s tridiagonal matrix. Comput. Math. Appl. 27, 8 (1994), 
59-66. 



